Dataset statistics
| Number of variables | 20 |
|---|---|
| Number of observations | 2666 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 398.5 KiB |
| Average record size in memory | 153.0 B |
Variable types
| NUM | 15 |
|---|---|
| BOOL | 3 |
| CAT | 2 |
State has a high cardinality: 51 distinct values | High cardinality |
Total day charge is highly correlated with Total day minutes | High correlation |
Total day minutes is highly correlated with Total day charge | High correlation |
Total eve charge is highly correlated with Total eve minutes | High correlation |
Total eve minutes is highly correlated with Total eve charge | High correlation |
Total night charge is highly correlated with Total night minutes | High correlation |
Total night minutes is highly correlated with Total night charge | High correlation |
Total intl charge is highly correlated with Total intl minutes | High correlation |
Total intl minutes is highly correlated with Total intl charge | High correlation |
Number vmail messages has 1933 (72.5%) zeros | Zeros |
Customer service calls has 555 (20.8%) zeros | Zeros |
Reproduction
| Analysis started | 2021-01-31 06:26:24.342885 |
|---|---|
| Analysis finished | 2021-01-31 06:27:45.520150 |
| Duration | 1 minute and 21.18 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 51 |
|---|---|
| Distinct (%) | 1.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.8 KiB |
| WV | 88 |
|---|---|
| MN | 70 |
| NY | 68 |
| VA | 67 |
| WY | 66 |
| Other values (46) |
| Value | Count | Frequency (%) | |
| WV | 88 | 3.3% | |
| MN | 70 | 2.6% | |
| NY | 68 | 2.6% | |
| VA | 67 | 2.5% | |
| WY | 66 | 2.5% | |
| OH | 66 | 2.5% | |
| AL | 66 | 2.5% | |
| OR | 62 | 2.3% | |
| WI | 61 | 2.3% | |
| NV | 61 | 2.3% | |
| Other values (41) | 1991 | 74.7% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Account length
Real number (ℝ≥0)
| Distinct | 205 |
|---|---|
| Distinct (%) | 7.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.6204051 |
|---|---|
| Minimum | 1 |
| Maximum | 243 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 36 |
| Q1 | 73 |
| median | 100 |
| Q3 | 127 |
| 95-th percentile | 166 |
| Maximum | 243 |
| Range | 242 |
| Interquartile range (IQR) | 54 |
Descriptive statistics
| Standard deviation | 39.56397365 |
|---|---|
| Coefficient of variation (CV) | 0.3932003018 |
| Kurtosis | -0.1383128669 |
| Mean | 100.6204051 |
| Median Absolute Deviation (MAD) | 27 |
| Skewness | 0.07902340636 |
| Sum | 268254 |
| Variance | 1565.308011 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 93 | 35 | 1.3% | |
| 87 | 33 | 1.2% | |
| 105 | 33 | 1.2% | |
| 101 | 32 | 1.2% | |
| 99 | 32 | 1.2% | |
| 100 | 31 | 1.2% | |
| 116 | 29 | 1.1% | |
| 106 | 29 | 1.1% | |
| 98 | 29 | 1.1% | |
| 90 | 29 | 1.1% | |
| Other values (195) | 2354 | 88.3% |
| Value | Count | Frequency (%) | |
| 1 | 6 | 0.2% | |
| 2 | 1 | < 0.1% | |
| 3 | 4 | 0.2% | |
| 4 | 1 | < 0.1% | |
| 5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 243 | 1 | < 0.1% | |
| 225 | 2 | 0.1% | |
| 224 | 2 | 0.1% | |
| 221 | 1 | < 0.1% | |
| 217 | 1 | < 0.1% |
Area code
Categorical
| Distinct | 3 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.8 KiB |
| 415 | |
|---|---|
| 510 | |
| 408 |
| Value | Count | Frequency (%) | |
| 415 | 1318 | 49.4% | |
| 510 | 679 | 25.5% | |
| 408 | 669 | 25.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
International plan
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.8 KiB |
| No | |
|---|---|
| Yes |
| Value | Count | Frequency (%) | |
| No | 2396 | 89.9% | |
| Yes | 270 | 10.1% |
Voice mail plan
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 20.8 KiB |
| No | |
|---|---|
| Yes |
| Value | Count | Frequency (%) | |
| No | 1933 | 72.5% | |
| Yes | 733 | 27.5% |
| Distinct | 42 |
|---|---|
| Distinct (%) | 1.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 8.021755439 |
|---|---|
| Minimum | 0 |
| Maximum | 50 |
| Zeros | 1933 |
| Zeros (%) | 72.5% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 19 |
| 95-th percentile | 36 |
| Maximum | 50 |
| Range | 50 |
| Interquartile range (IQR) | 19 |
Descriptive statistics
| Standard deviation | 13.61227702 |
|---|---|
| Coefficient of variation (CV) | 1.696919972 |
| Kurtosis | -0.04015788882 |
| Mean | 8.021755439 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.271773633 |
| Sum | 21386 |
| Variance | 185.2940856 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0 | 1933 | 72.5% | |
| 31 | 50 | 1.9% | |
| 28 | 42 | 1.6% | |
| 29 | 39 | 1.5% | |
| 24 | 37 | 1.4% | |
| 33 | 37 | 1.4% | |
| 30 | 35 | 1.3% | |
| 27 | 34 | 1.3% | |
| 25 | 33 | 1.2% | |
| 32 | 33 | 1.2% | |
| Other values (32) | 393 | 14.7% |
| Value | Count | Frequency (%) | |
| 0 | 1933 | 72.5% | |
| 4 | 1 | < 0.1% | |
| 8 | 2 | 0.1% | |
| 9 | 2 | 0.1% | |
| 10 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 50 | 2 | 0.1% | |
| 47 | 3 | 0.1% | |
| 46 | 3 | 0.1% | |
| 45 | 4 | 0.2% | |
| 44 | 7 | 0.3% |
| Distinct | 1489 |
|---|---|
| Distinct (%) | 55.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 179.4816204 |
|---|---|
| Minimum | 0 |
| Maximum | 350.8 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 90.425 |
| Q1 | 143.4 |
| median | 179.95 |
| Q3 | 215.9 |
| 95-th percentile | 269.775 |
| Maximum | 350.8 |
| Range | 350.8 |
| Interquartile range (IQR) | 72.5 |
Descriptive statistics
| Standard deviation | 54.21035022 |
|---|---|
| Coefficient of variation (CV) | 0.3020384488 |
| Kurtosis | 0.01936427966 |
| Mean | 179.4816204 |
| Median Absolute Deviation (MAD) | 36.25 |
| Skewness | -0.05310559809 |
| Sum | 478498 |
| Variance | 2938.762071 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 162.3 | 7 | 0.3% | |
| 183.4 | 7 | 0.3% | |
| 194.8 | 6 | 0.2% | |
| 175.4 | 6 | 0.2% | |
| 159.5 | 6 | 0.2% | |
| 185 | 6 | 0.2% | |
| 216 | 6 | 0.2% | |
| 145 | 5 | 0.2% | |
| 124.3 | 5 | 0.2% | |
| 141.3 | 5 | 0.2% | |
| Other values (1479) | 2607 | 97.8% |
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.1% | |
| 2.6 | 1 | < 0.1% | |
| 7.8 | 1 | < 0.1% | |
| 7.9 | 1 | < 0.1% | |
| 12.5 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 350.8 | 1 | < 0.1% | |
| 346.8 | 1 | < 0.1% | |
| 345.3 | 1 | < 0.1% | |
| 337.4 | 1 | < 0.1% | |
| 335.5 | 1 | < 0.1% |
Total day calls
Real number (ℝ≥0)
| Distinct | 115 |
|---|---|
| Distinct (%) | 4.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.3102026 |
|---|---|
| Minimum | 0 |
| Maximum | 160 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 67 |
| Q1 | 87 |
| median | 101 |
| Q3 | 114 |
| 95-th percentile | 133 |
| Maximum | 160 |
| Range | 160 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 19.98816219 |
|---|---|
| Coefficient of variation (CV) | 0.1992635014 |
| Kurtosis | 0.2895491547 |
| Mean | 100.3102026 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | -0.1282668464 |
| Sum | 267427 |
| Variance | 399.5266276 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 105 | 62 | 2.3% | |
| 106 | 59 | 2.2% | |
| 108 | 59 | 2.2% | |
| 112 | 58 | 2.2% | |
| 107 | 57 | 2.1% | |
| 102 | 57 | 2.1% | |
| 100 | 56 | 2.1% | |
| 104 | 55 | 2.1% | |
| 95 | 55 | 2.1% | |
| 88 | 54 | 2.0% | |
| Other values (105) | 2094 | 78.5% |
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.1% | |
| 36 | 1 | < 0.1% | |
| 40 | 1 | < 0.1% | |
| 42 | 2 | 0.1% | |
| 44 | 3 | 0.1% |
| Value | Count | Frequency (%) | |
| 160 | 1 | < 0.1% | |
| 158 | 3 | 0.1% | |
| 157 | 1 | < 0.1% | |
| 156 | 1 | < 0.1% | |
| 152 | 1 | < 0.1% |
| Distinct | 1489 |
|---|---|
| Distinct (%) | 55.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 30.51240435 |
|---|---|
| Minimum | 0 |
| Maximum | 59.64 |
| Zeros | 2 |
| Zeros (%) | 0.1% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 15.375 |
| Q1 | 24.38 |
| median | 30.59 |
| Q3 | 36.7 |
| 95-th percentile | 45.865 |
| Maximum | 59.64 |
| Range | 59.64 |
| Interquartile range (IQR) | 12.32 |
Descriptive statistics
| Standard deviation | 9.215732907 |
|---|---|
| Coefficient of variation (CV) | 0.3020323407 |
| Kurtosis | 0.01950186757 |
| Mean | 30.51240435 |
| Median Absolute Deviation (MAD) | 6.16 |
| Skewness | -0.0530869042 |
| Sum | 81346.07 |
| Variance | 84.92973302 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 27.59 | 7 | 0.3% | |
| 31.18 | 7 | 0.3% | |
| 31.45 | 6 | 0.2% | |
| 29.82 | 6 | 0.2% | |
| 36.72 | 6 | 0.2% | |
| 33.12 | 6 | 0.2% | |
| 27.12 | 6 | 0.2% | |
| 24.65 | 5 | 0.2% | |
| 33.73 | 5 | 0.2% | |
| 35.29 | 5 | 0.2% | |
| Other values (1479) | 2607 | 97.8% |
| Value | Count | Frequency (%) | |
| 0 | 2 | 0.1% | |
| 0.44 | 1 | < 0.1% | |
| 1.33 | 1 | < 0.1% | |
| 1.34 | 1 | < 0.1% | |
| 2.13 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 59.64 | 1 | < 0.1% | |
| 58.96 | 1 | < 0.1% | |
| 58.7 | 1 | < 0.1% | |
| 57.36 | 1 | < 0.1% | |
| 57.04 | 1 | < 0.1% |
| Distinct | 1442 |
|---|---|
| Distinct (%) | 54.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 200.386159 |
|---|---|
| Minimum | 0 |
| Maximum | 363.7 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 118.725 |
| Q1 | 165.3 |
| median | 200.9 |
| Q3 | 235.1 |
| 95-th percentile | 285.025 |
| Maximum | 363.7 |
| Range | 363.7 |
| Interquartile range (IQR) | 69.8 |
Descriptive statistics
| Standard deviation | 50.95151512 |
|---|---|
| Coefficient of variation (CV) | 0.2542666388 |
| Kurtosis | -0.02549313226 |
| Mean | 200.386159 |
| Median Absolute Deviation (MAD) | 35 |
| Skewness | -0.01266524296 |
| Sum | 534229.5 |
| Variance | 2596.056893 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 169.9 | 8 | 0.3% | |
| 220.6 | 7 | 0.3% | |
| 167.2 | 7 | 0.3% | |
| 161.7 | 7 | 0.3% | |
| 181.6 | 6 | 0.2% | |
| 195.5 | 6 | 0.2% | |
| 194 | 6 | 0.2% | |
| 224.9 | 6 | 0.2% | |
| 205.1 | 6 | 0.2% | |
| 209.4 | 6 | 0.2% | |
| Other values (1432) | 2601 | 97.6% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 31.2 | 1 | < 0.1% | |
| 42.2 | 1 | < 0.1% | |
| 42.5 | 1 | < 0.1% | |
| 43.9 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 363.7 | 1 | < 0.1% | |
| 354.2 | 1 | < 0.1% | |
| 350.9 | 1 | < 0.1% | |
| 348.5 | 1 | < 0.1% | |
| 347.3 | 1 | < 0.1% |
Total eve calls
Real number (ℝ≥0)
| Distinct | 120 |
|---|---|
| Distinct (%) | 4.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.0236309 |
|---|---|
| Minimum | 0 |
| Maximum | 170 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 67 |
| Q1 | 87 |
| median | 100 |
| Q3 | 114 |
| 95-th percentile | 133 |
| Maximum | 170 |
| Range | 170 |
| Interquartile range (IQR) | 27 |
Descriptive statistics
| Standard deviation | 20.16144512 |
|---|---|
| Coefficient of variation (CV) | 0.2015668191 |
| Kurtosis | 0.1893960643 |
| Mean | 100.0236309 |
| Median Absolute Deviation (MAD) | 13.5 |
| Skewness | -0.06520928393 |
| Sum | 266663 |
| Variance | 406.4838691 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 105 | 64 | 2.4% | |
| 94 | 62 | 2.3% | |
| 109 | 58 | 2.2% | |
| 102 | 56 | 2.1% | |
| 108 | 55 | 2.1% | |
| 87 | 54 | 2.0% | |
| 97 | 54 | 2.0% | |
| 115 | 53 | 2.0% | |
| 111 | 52 | 2.0% | |
| 98 | 52 | 2.0% | |
| Other values (110) | 2106 | 79.0% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 12 | 1 | < 0.1% | |
| 36 | 1 | < 0.1% | |
| 42 | 1 | < 0.1% | |
| 43 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 170 | 1 | < 0.1% | |
| 159 | 1 | < 0.1% | |
| 157 | 1 | < 0.1% | |
| 156 | 1 | < 0.1% | |
| 155 | 2 | 0.1% |
| Distinct | 1301 |
|---|---|
| Distinct (%) | 48.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 17.03307202 |
|---|---|
| Minimum | 0 |
| Maximum | 30.91 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 10.0925 |
| Q1 | 14.05 |
| median | 17.08 |
| Q3 | 19.98 |
| 95-th percentile | 24.225 |
| Maximum | 30.91 |
| Range | 30.91 |
| Interquartile range (IQR) | 5.93 |
Descriptive statistics
| Standard deviation | 4.330864177 |
|---|---|
| Coefficient of variation (CV) | 0.2542620716 |
| Kurtosis | -0.0255701337 |
| Mean | 17.03307202 |
| Median Absolute Deviation (MAD) | 2.98 |
| Skewness | -0.01262903519 |
| Sum | 45410.17 |
| Variance | 18.75638452 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 16.12 | 9 | 0.3% | |
| 14.25 | 9 | 0.3% | |
| 14.44 | 8 | 0.3% | |
| 18.96 | 8 | 0.3% | |
| 18.62 | 8 | 0.3% | |
| 17.43 | 8 | 0.3% | |
| 17.99 | 8 | 0.3% | |
| 18.75 | 7 | 0.3% | |
| 16.63 | 7 | 0.3% | |
| 16.97 | 7 | 0.3% | |
| Other values (1291) | 2587 | 97.0% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 2.65 | 1 | < 0.1% | |
| 3.59 | 1 | < 0.1% | |
| 3.61 | 1 | < 0.1% | |
| 3.73 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 30.91 | 1 | < 0.1% | |
| 30.11 | 1 | < 0.1% | |
| 29.83 | 1 | < 0.1% | |
| 29.62 | 1 | < 0.1% | |
| 29.52 | 1 | < 0.1% |
| Distinct | 1444 |
|---|---|
| Distinct (%) | 54.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 201.1689422 |
|---|---|
| Minimum | 43.7 |
| Maximum | 395 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 43.7 |
|---|---|
| 5-th percentile | 117.925 |
| Q1 | 166.925 |
| median | 201.15 |
| Q3 | 236.475 |
| 95-th percentile | 283.675 |
| Maximum | 395 |
| Range | 351.3 |
| Interquartile range (IQR) | 69.55 |
Descriptive statistics
| Standard deviation | 50.78032337 |
|---|---|
| Coefficient of variation (CV) | 0.2524262583 |
| Kurtosis | 0.05038227445 |
| Mean | 201.1689422 |
| Median Absolute Deviation (MAD) | 34.8 |
| Skewness | 0.02336249992 |
| Sum | 536316.4 |
| Variance | 2578.641241 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 214.7 | 7 | 0.3% | |
| 172.7 | 6 | 0.2% | |
| 181.2 | 6 | 0.2% | |
| 214.6 | 6 | 0.2% | |
| 193.6 | 6 | 0.2% | |
| 182.1 | 6 | 0.2% | |
| 210 | 6 | 0.2% | |
| 214 | 6 | 0.2% | |
| 197.4 | 6 | 0.2% | |
| 194.3 | 6 | 0.2% | |
| Other values (1434) | 2605 | 97.7% |
| Value | Count | Frequency (%) | |
| 43.7 | 1 | < 0.1% | |
| 45 | 1 | < 0.1% | |
| 47.4 | 1 | < 0.1% | |
| 50.1 | 2 | 0.1% | |
| 53.3 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 395 | 1 | < 0.1% | |
| 381.9 | 1 | < 0.1% | |
| 377.5 | 1 | < 0.1% | |
| 364.9 | 1 | < 0.1% | |
| 364.3 | 1 | < 0.1% |
Total night calls
Real number (ℝ≥0)
| Distinct | 118 |
|---|---|
| Distinct (%) | 4.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 100.1061515 |
|---|---|
| Minimum | 33 |
| Maximum | 166 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 33 |
|---|---|
| 5-th percentile | 68 |
| Q1 | 87 |
| median | 100 |
| Q3 | 113 |
| 95-th percentile | 131 |
| Maximum | 166 |
| Range | 133 |
| Interquartile range (IQR) | 26 |
Descriptive statistics
| Standard deviation | 19.41845855 |
|---|---|
| Coefficient of variation (CV) | 0.1939786742 |
| Kurtosis | -0.04800868253 |
| Mean | 100.1061515 |
| Median Absolute Deviation (MAD) | 13 |
| Skewness | 0.01041040145 |
| Sum | 266883 |
| Variance | 377.0765325 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 105 | 70 | 2.6% | |
| 104 | 67 | 2.5% | |
| 91 | 60 | 2.3% | |
| 102 | 58 | 2.2% | |
| 106 | 58 | 2.2% | |
| 100 | 57 | 2.1% | |
| 96 | 54 | 2.0% | |
| 95 | 53 | 2.0% | |
| 108 | 53 | 2.0% | |
| 98 | 53 | 2.0% | |
| Other values (108) | 2083 | 78.1% |
| Value | Count | Frequency (%) | |
| 33 | 1 | < 0.1% | |
| 36 | 1 | < 0.1% | |
| 38 | 1 | < 0.1% | |
| 42 | 1 | < 0.1% | |
| 44 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 166 | 1 | < 0.1% | |
| 164 | 1 | < 0.1% | |
| 158 | 1 | < 0.1% | |
| 157 | 2 | 0.1% | |
| 156 | 2 | 0.1% |
| Distinct | 885 |
|---|---|
| Distinct (%) | 33.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.052689422 |
|---|---|
| Minimum | 1.97 |
| Maximum | 17.77 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 1.97 |
|---|---|
| 5-th percentile | 5.31 |
| Q1 | 7.5125 |
| median | 9.05 |
| Q3 | 10.64 |
| 95-th percentile | 12.7675 |
| Maximum | 17.77 |
| Range | 15.8 |
| Interquartile range (IQR) | 3.1275 |
Descriptive statistics
| Standard deviation | 2.285119513 |
|---|---|
| Coefficient of variation (CV) | 0.2524243798 |
| Kurtosis | 0.05008123142 |
| Mean | 9.052689422 |
| Median Absolute Deviation (MAD) | 1.565 |
| Skewness | 0.0233184743 |
| Sum | 24134.47 |
| Variance | 5.221771188 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 9.66 | 13 | 0.5% | |
| 8.88 | 11 | 0.4% | |
| 7.15 | 10 | 0.4% | |
| 9.63 | 10 | 0.4% | |
| 9.14 | 10 | 0.4% | |
| 10.49 | 9 | 0.3% | |
| 8.64 | 9 | 0.3% | |
| 10.35 | 9 | 0.3% | |
| 9.23 | 9 | 0.3% | |
| 10.8 | 9 | 0.3% | |
| Other values (875) | 2567 | 96.3% |
| Value | Count | Frequency (%) | |
| 1.97 | 1 | < 0.1% | |
| 2.03 | 1 | < 0.1% | |
| 2.13 | 1 | < 0.1% | |
| 2.25 | 2 | 0.1% | |
| 2.4 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 17.77 | 1 | < 0.1% | |
| 17.19 | 1 | < 0.1% | |
| 16.99 | 1 | < 0.1% | |
| 16.42 | 1 | < 0.1% | |
| 16.39 | 1 | < 0.1% |
| Distinct | 158 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 10.23702176 |
|---|---|
| Minimum | 0 |
| Maximum | 20 |
| Zeros | 15 |
| Zeros (%) | 0.6% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5.8 |
| Q1 | 8.5 |
| median | 10.2 |
| Q3 | 12.1 |
| 95-th percentile | 14.7 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 3.6 |
Descriptive statistics
| Standard deviation | 2.788348577 |
|---|---|
| Coefficient of variation (CV) | 0.2723788855 |
| Kurtosis | 0.6165548282 |
| Mean | 10.23702176 |
| Median Absolute Deviation (MAD) | 1.8 |
| Skewness | -0.2244342469 |
| Sum | 27291.9 |
| Variance | 7.774887787 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 10 | 54 | 2.0% | |
| 10.2 | 47 | 1.8% | |
| 9.8 | 45 | 1.7% | |
| 11.5 | 43 | 1.6% | |
| 9.1 | 42 | 1.6% | |
| 11.3 | 42 | 1.6% | |
| 10.6 | 42 | 1.6% | |
| 9.7 | 41 | 1.5% | |
| 9.5 | 41 | 1.5% | |
| 10.9 | 41 | 1.5% | |
| Other values (148) | 2228 | 83.6% |
| Value | Count | Frequency (%) | |
| 0 | 15 | 0.6% | |
| 1.1 | 1 | < 0.1% | |
| 1.3 | 1 | < 0.1% | |
| 2.1 | 1 | < 0.1% | |
| 2.2 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 20 | 1 | < 0.1% | |
| 18.9 | 1 | < 0.1% | |
| 18.4 | 1 | < 0.1% | |
| 18.2 | 2 | 0.1% | |
| 18 | 2 | 0.1% |
Total intl calls
Real number (ℝ≥0)
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.8% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.467366842 |
|---|---|
| Minimum | 0 |
| Maximum | 20 |
| Zeros | 15 |
| Zeros (%) | 0.6% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 3 |
| median | 4 |
| Q3 | 6 |
| 95-th percentile | 9 |
| Maximum | 20 |
| Range | 20 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 2.456194903 |
|---|---|
| Coefficient of variation (CV) | 0.5498081958 |
| Kurtosis | 3.266618782 |
| Mean | 4.467366842 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.358768517 |
| Sum | 11910 |
| Variance | 6.032893402 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 3 | 544 | 20.4% | |
| 4 | 503 | 18.9% | |
| 2 | 388 | 14.6% | |
| 5 | 376 | 14.1% | |
| 6 | 267 | 10.0% | |
| 7 | 172 | 6.5% | |
| 1 | 125 | 4.7% | |
| 8 | 90 | 3.4% | |
| 9 | 83 | 3.1% | |
| 10 | 37 | 1.4% | |
| Other values (11) | 81 | 3.0% |
| Value | Count | Frequency (%) | |
| 0 | 15 | 0.6% | |
| 1 | 125 | 4.7% | |
| 2 | 388 | 14.6% | |
| 3 | 544 | 20.4% | |
| 4 | 503 | 18.9% |
| Value | Count | Frequency (%) | |
| 20 | 1 | < 0.1% | |
| 19 | 1 | < 0.1% | |
| 18 | 2 | 0.1% | |
| 17 | 1 | < 0.1% | |
| 16 | 2 | 0.1% |
| Distinct | 158 |
|---|---|
| Distinct (%) | 5.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.764489872 |
|---|---|
| Minimum | 0 |
| Maximum | 5.4 |
| Zeros | 15 |
| Zeros (%) | 0.6% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.57 |
| Q1 | 2.3 |
| median | 2.75 |
| Q3 | 3.27 |
| 95-th percentile | 3.97 |
| Maximum | 5.4 |
| Range | 5.4 |
| Interquartile range (IQR) | 0.97 |
Descriptive statistics
| Standard deviation | 0.7528120531 |
|---|---|
| Coefficient of variation (CV) | 0.2723149976 |
| Kurtosis | 0.6175371435 |
| Mean | 2.764489872 |
| Median Absolute Deviation (MAD) | 0.49 |
| Skewness | -0.2245685267 |
| Sum | 7370.13 |
| Variance | 0.5667259873 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 2.7 | 54 | 2.0% | |
| 2.75 | 47 | 1.8% | |
| 2.65 | 45 | 1.7% | |
| 3.11 | 43 | 1.6% | |
| 2.46 | 42 | 1.6% | |
| 2.86 | 42 | 1.6% | |
| 3.05 | 42 | 1.6% | |
| 2.97 | 41 | 1.5% | |
| 2.57 | 41 | 1.5% | |
| 3.08 | 41 | 1.5% | |
| Other values (148) | 2228 | 83.6% |
| Value | Count | Frequency (%) | |
| 0 | 15 | 0.6% | |
| 0.3 | 1 | < 0.1% | |
| 0.35 | 1 | < 0.1% | |
| 0.57 | 1 | < 0.1% | |
| 0.59 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 5.4 | 1 | < 0.1% | |
| 5.1 | 1 | < 0.1% | |
| 4.97 | 1 | < 0.1% | |
| 4.91 | 2 | 0.1% | |
| 4.86 | 2 | 0.1% |
| Distinct | 10 |
|---|---|
| Distinct (%) | 0.4% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.56264066 |
|---|---|
| Minimum | 0 |
| Maximum | 9 |
| Zeros | 555 |
| Zeros (%) | 20.8% |
| Memory size | 20.8 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 1 |
| Q3 | 2 |
| 95-th percentile | 4 |
| Maximum | 9 |
| Range | 9 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 1.311235759 |
|---|---|
| Coefficient of variation (CV) | 0.8391153465 |
| Kurtosis | 1.813987028 |
| Mean | 1.56264066 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.095176262 |
| Sum | 4166 |
| Variance | 1.719339216 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 1 | 945 | 35.4% | |
| 2 | 608 | 22.8% | |
| 0 | 555 | 20.8% | |
| 3 | 348 | 13.1% | |
| 4 | 133 | 5.0% | |
| 5 | 49 | 1.8% | |
| 6 | 17 | 0.6% | |
| 7 | 8 | 0.3% | |
| 9 | 2 | 0.1% | |
| 8 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0 | 555 | 20.8% | |
| 1 | 945 | 35.4% | |
| 2 | 608 | 22.8% | |
| 3 | 348 | 13.1% | |
| 4 | 133 | 5.0% |
| Value | Count | Frequency (%) | |
| 9 | 2 | 0.1% | |
| 8 | 1 | < 0.1% | |
| 7 | 8 | 0.3% | |
| 6 | 17 | 0.6% | |
| 5 | 49 | 1.8% |
Churn
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.6 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) | |
| False | 2278 | 85.4% | |
| True | 388 | 14.6% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| State | Account length | Area code | International plan | Voice mail plan | Number vmail messages | Total day minutes | Total day calls | Total day charge | Total eve minutes | Total eve calls | Total eve charge | Total night minutes | Total night calls | Total night charge | Total intl minutes | Total intl calls | Total intl charge | Customer service calls | Churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | KS | 128 | 415 | No | Yes | 25 | 265.1 | 110 | 45.07 | 197.4 | 99 | 16.78 | 244.7 | 91 | 11.01 | 10.0 | 3 | 2.70 | 1 | False |
| 1 | OH | 107 | 415 | No | Yes | 26 | 161.6 | 123 | 27.47 | 195.5 | 103 | 16.62 | 254.4 | 103 | 11.45 | 13.7 | 3 | 3.70 | 1 | False |
| 2 | NJ | 137 | 415 | No | No | 0 | 243.4 | 114 | 41.38 | 121.2 | 110 | 10.30 | 162.6 | 104 | 7.32 | 12.2 | 5 | 3.29 | 0 | False |
| 3 | OH | 84 | 408 | Yes | No | 0 | 299.4 | 71 | 50.90 | 61.9 | 88 | 5.26 | 196.9 | 89 | 8.86 | 6.6 | 7 | 1.78 | 2 | False |
| 4 | OK | 75 | 415 | Yes | No | 0 | 166.7 | 113 | 28.34 | 148.3 | 122 | 12.61 | 186.9 | 121 | 8.41 | 10.1 | 3 | 2.73 | 3 | False |
| 5 | AL | 118 | 510 | Yes | No | 0 | 223.4 | 98 | 37.98 | 220.6 | 101 | 18.75 | 203.9 | 118 | 9.18 | 6.3 | 6 | 1.70 | 0 | False |
| 6 | MA | 121 | 510 | No | Yes | 24 | 218.2 | 88 | 37.09 | 348.5 | 108 | 29.62 | 212.6 | 118 | 9.57 | 7.5 | 7 | 2.03 | 3 | False |
| 7 | MO | 147 | 415 | Yes | No | 0 | 157.0 | 79 | 26.69 | 103.1 | 94 | 8.76 | 211.8 | 96 | 9.53 | 7.1 | 6 | 1.92 | 0 | False |
| 8 | WV | 141 | 415 | Yes | Yes | 37 | 258.6 | 84 | 43.96 | 222.0 | 111 | 18.87 | 326.4 | 97 | 14.69 | 11.2 | 5 | 3.02 | 0 | False |
| 9 | RI | 74 | 415 | No | No | 0 | 187.7 | 127 | 31.91 | 163.4 | 148 | 13.89 | 196.0 | 94 | 8.82 | 9.1 | 5 | 2.46 | 0 | False |
Last rows
| State | Account length | Area code | International plan | Voice mail plan | Number vmail messages | Total day minutes | Total day calls | Total day charge | Total eve minutes | Total eve calls | Total eve charge | Total night minutes | Total night calls | Total night charge | Total intl minutes | Total intl calls | Total intl charge | Customer service calls | Churn | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2656 | GA | 122 | 510 | Yes | No | 0 | 140.0 | 101 | 23.80 | 196.4 | 77 | 16.69 | 120.1 | 133 | 5.40 | 9.7 | 4 | 2.62 | 4 | True |
| 2657 | MD | 62 | 408 | No | No | 0 | 321.1 | 105 | 54.59 | 265.5 | 122 | 22.57 | 180.5 | 72 | 8.12 | 11.5 | 2 | 3.11 | 4 | True |
| 2658 | IN | 117 | 415 | No | No | 0 | 118.4 | 126 | 20.13 | 249.3 | 97 | 21.19 | 227.0 | 56 | 10.22 | 13.6 | 3 | 3.67 | 5 | True |
| 2659 | OH | 78 | 408 | No | No | 0 | 193.4 | 99 | 32.88 | 116.9 | 88 | 9.94 | 243.3 | 109 | 10.95 | 9.3 | 4 | 2.51 | 2 | False |
| 2660 | OH | 96 | 415 | No | No | 0 | 106.6 | 128 | 18.12 | 284.8 | 87 | 24.21 | 178.9 | 92 | 8.05 | 14.9 | 7 | 4.02 | 1 | False |
| 2661 | SC | 79 | 415 | No | No | 0 | 134.7 | 98 | 22.90 | 189.7 | 68 | 16.12 | 221.4 | 128 | 9.96 | 11.8 | 5 | 3.19 | 2 | False |
| 2662 | AZ | 192 | 415 | No | Yes | 36 | 156.2 | 77 | 26.55 | 215.5 | 126 | 18.32 | 279.1 | 83 | 12.56 | 9.9 | 6 | 2.67 | 2 | False |
| 2663 | WV | 68 | 415 | No | No | 0 | 231.1 | 57 | 39.29 | 153.4 | 55 | 13.04 | 191.3 | 123 | 8.61 | 9.6 | 4 | 2.59 | 3 | False |
| 2664 | RI | 28 | 510 | No | No | 0 | 180.8 | 109 | 30.74 | 288.8 | 58 | 24.55 | 191.9 | 91 | 8.64 | 14.1 | 6 | 3.81 | 2 | False |
| 2665 | TN | 74 | 415 | No | Yes | 25 | 234.4 | 113 | 39.85 | 265.9 | 82 | 22.60 | 241.4 | 77 | 10.86 | 13.7 | 4 | 3.70 | 0 | False |